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Abstract 

There is a plethora of published cancer biomarkers but the reality is that very few, if any, new circulating cancer 
biomarkers have entered the clinic in the last 30 years. I here try to explain this apparent oxymoron by classifying 
circulating cancer biomarkers into three categories: fraudulent reports (rare); true discoveries of biomarkers, that 
then fail to meet the demands of the clinic; and false discoveries, which represent artifactual biomarkers. I further 
provide examples of combinations of some known cancer biomarkers that can perform well in niche clinical 
applications, despite individually being not useful. 
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Background 

There is wide debate recently as to why very few, if any, 
new circulating cancer biomarkers have entered the 
clinic in the last 30 years. The vast majority of clinically 
useful cancer biomarkers were discovered between the 
mid-1960s (for example, carcinoembryonic antigen, 
CEA) and the early 1980s (for example, prostate-specific 
antigen (PSA) and carbohydrate antigen 125 (CA125)). 
It is true that major investments by both academia and 
industry have been made in this area of investigation 
but with very little return. One wonders why this hap- 
pens in an era of spectacular technological advances. 
Some argue that it is because the problem is very com- 
plex and underestimated; others blame reasons such as 
a lack of understanding of the pathobiology of cancer, 
not enough funding, use of inappropriate samples for 
discovery and validation, methodological limitations, and 
so on [1,2]. 

Here, I will provide a brief and simplified analysis as 
to why this may be happening. We should keep in mind 
that the high failure rates in the biomarker field are no 
different from those of therapeutics. But there is an 
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important difference. Therapeutics leading to relatively 
small improvements in patient survival (weeks to 
months) are likely to be marketed, while diagnostics 
with relatively small improvements in patient diagnosis 
or prognosis will likely fall by the wayside. Hence, simi- 
lar advances in therapeutics and diagnostics can be 
hailed as 'successes' in the former and 'failures' in the 
latter. 

Why do most biomarkers fail to reach the clinic? 

One way of analyzing the apparent failures in diagnos- 
tics is by classifying them into three distinct categories 
(Figure 1). The first category of failing biomarkers 
includes those that are based on fraudulent publications. 
These are quite rare and, despite some highly-publicized 
cases [3], fraud is not the major reason for failing 
biomarkers. 

The second and largest category of biomarkers that 
never reach the clinic (and counted as failures) includes 
those that have been discovered and validated by using 
robust and reliable techniques (true discovery). These 
biomarkers have successfully gone through the process 
of discovery and validation, with reproducible and con- 
cordant results between studies, from early to late 
stages. However, these biomarkers fall short in their 
ability to contribute decisively to patient care, except for 
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Why Biomarkers Fail to Reach the Clinic 
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True Discovery False Discovery 








Weak Clinical Performance Original claims fail validation 



IS 



■ Low specificity 

■ Low sensitivity 

■ Low prognostic/predictive value 

■ Information not necessary for 
clinical decision-making 



Shortcomings 

■ Pre-analytical 

■ Analytical 

■ Post-analytical 

■ Bioinformatic 



Figure 1 Summary of reasons for biomarker failure to reach the clinic. Fraud is a very rare reason for biomarker failures; most biomarkers 
represent true discoveries but their clinical characteristics are not good enough to be used at the clinic. 



providing some incremental, but clinically not essential, 
information. For example, the urokinase plasminogen 
activator/plasminogen activator inhibitor 1 (uPA/PAI 1) 
combination of biomarkers has long been regarded as a 
prognostic indicator of breast carcinoma, and many ret- 
rospective and prospective studies and meta-analyses 
have confirmed their prognostic value [4]. However, 
they have not been widely adopted, especially in North 
America, despite availability of excellent ELISA meth- 
odologies for their measurement, because clinicians do 
not seem to find this information necessary in deciding 
how to treat their patients. Rather, they decide on treat- 
ment options without this information, thus saving 
costs. Clinicians usually prefer to over-treat some 
patients, instead of using prognostic biomarkers with 
less than perfect prediction. Imperfect prognostic bio- 
markers could spare a fraction of patients from over- 
treatment (true positives), but at the cost of not treating 
some patients who could benefit from treatment (false 
negatives). Another example is the tumor suppressor 



p53. A search in PubMed for the term 'p53 AND breast 
cancer prognosis' identifies 1470 papers, with the vast 
majority confirming the prognostic value of p53, despite 
its sparse use at the clinic, if any, due to the reasons 
mentioned above (imperfect or weak prognostic value). 

Other examples in this category include those biomar- 
kers that have been discovered and validated thoroughly 
by industry, and although found to have some use in clin- 
ical prediction, the strength of their predictive ability is 
not enough to persuade clinicians to use them, or clinical 
practice guideline developers to recommend them. An 
example is the novel ovarian cancer biomarker, B7-H4 
(discovered by diaDexus Inc., San Francisco, CA, USA), 
which was validated for its ability to diagnose ovarian 
cancer [5]. Recently, an independent group confirmed 
the diagnostic ability of this biomarker, but also demon- 
strated that it is not better than the classical biomarker, 
CA125 [6]. Consequently, the company that discovered 
it, in the absence of a clear clinical utility, decided not to 
market it. It is quite expensive to conduct the necessary 
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clinical trials to obtain approval from the Food and Drug 
Administration (FDA). There are numerous examples of 
this sort in the literature, that is, of reasonable and work- 
ing biomarkers that fall short of fulfilling a clear clinical 
need and thus unlikely to be profitable if marketed. 

There are also numerous examples of biomarkers 
which have good sensitivity (>70% at 90% to 95% speci- 
ficity) in detecting late carcinoma but poor sensitivity in 
detecting disease in asymptomatic patients, especially at 
the extremely high specificity required for screening (for 
example, >99.5%, as is the case with ovarian carcinoma) 
[6]. Consequently, none of the available ovarian cancer 
biomarkers are suitable for screening, and it may be 
unlikely that we will find any that can perform at these 
clinically dictated and highly demanding specifications 
(for example, >80% sensitivity for early and asympto- 
matic disease, at > 99.5% specificity; to achieve a reason- 
able positive predictive value of >10%). 

From this discussion, it can be concluded that a very 
large number of candidate biomarkers have been discov- 
ered, and have been confirmed by reliable methods to 
provide diagnostic, prognostic or predictive information 
in certain groups of patients. Unfortunately, this infor- 
mation could not be translated into action for better 
patient management and outcomes. So, work on bio- 
markers is continuing. However, these biomarkers are 
not recommended in practice guidelines for use in the 
diagnosis or treatment of cancer [7], despite their statis- 
tically significant (but clinically not useful) diagnostic, 
predictive or prognostic information. Some examples of 
well-validated biomarkers and their possible reason of 
failure at the clinic are outlined in Table 1. 

There is another group of biomarkers which may initi- 
ally look highly promising (or even revolutionary) but 
for which shortcomings have been identified, either at 
the discovery or validation phase. For example, in my 
previous commentary [8], I identified pre-analytical, ana- 
lytical and post-analytical shortcomings of many 



published biomarkers, which could invalidate the origi- 
nal performance claims. This group of biomarkers 
should be considered as 'false discovery'. They will not 
reach the clinic because the original performance claims 
could not be independently reproduced in subsequent 
validation studies. 

Future prospects: how do we overcome failures? 

In light of the analysis above, it can be concluded that the 
failure of the myriad of biomarkers to reach the clinic, 
excluding fraud, is either due to inadequate performance 
in a clinical setting or to false discovery. There are at 
least two ways to improve this situation. For those bio- 
markers with a well-validated performance, but which 
are not good enough for clinical use, it may be possible 
to identify clinical scenarios for which the markers could 
still help, in combination with other clinical or biomarker 
data. For example, human epididymis protein 4 (HE4), a 
new ovarian cancer biomarker, is not superior to CA125 
for diagnosis of ovarian carcinoma [6] but is more speci- 
fic. A combination of CA125 and HE4, through an algo- 
rithm, was found to be useful in the investigation of 
malignant versus benign pelvic masses [9]. Another test 
combines serum CA125 with a few other proteomic bio- 
markers which, individually, are not useful [10]. Both 
tests have recently received FDA approval (2011; 2009, 
respectively) for this specific application, thus reaching 
the clinic, in combinations, but not individually. There 
are examples of similar applications of otherwise not very 
informative diagnostic biomarkers, such as in the investi- 
gation of a computed tomography-positron emission 
tomography-identified indeterminate lung masses, for 
which a combination of already-known (but individually 
not useful) biomarkers may help [11]. Similar examples 
of 'niche unmet needs' include the identification of bio- 
markers to assess the risk of malignancy for thyroid 
nodules, specifically those with indeterminate results 
after standard thyroid biopsy; identification of biomarkers 



Table 1 Why well-validated biomarkers still fail to reach the clinic? 



Clinical application 



Reason to Fail 



Example 



Population screening and 
diagnosis 



Prognosis 



Monitoring 

Evaluating therapeutic 
response 



■ screening does not save lives; overdiagnosis/overtreatment 

• too many false positives leading to unnecessary and invasive 
confirmatory procedures 

• marker not profitable if marketed 

• weak prognostic value; clinicians prefer to overtreat instead of 
undertreat 

• no effective therapy available 

• no therapy for relapsing disease 

• many false positives and false negatives 



PSA and prostate cancer screening 

• CA 125 for ovarian cancer screening 

■ B7-H4 for ovarian cancer diagnosis 

• p53 and uPA/PAIl for breast cancer 

• CA 1 9.9 for pancreatic cancer 

■ CA125 for detection of early relapse of 
ovarian cancer 

• CA 1 5.3 for breast cancer 



CA: carbohydrate antigen; PSA: prostate-specific antigen 
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Table 2 Niche applications of combinations of biomarkers with Food and Drug Administration approval or with 
potential in the future (unmet needs) 



l_limcdl application 


Biomarkers 


JldlUS 


Investigation of pelvic mass 


■ CA 125 and HE4 (+ ROMA) 3 


• FDA-approved (2011) 




• CA 125 and 7 proteomic markers (+ Danish- 


• FDA-approved (2009) 




Index) 13 




Improve PSA specificity in screening 


• Serum PSA and urine PCA-3 and TMPRSS2-ERG 


■ Pending FDA 




fusions c 


approval 


Separate indolent from aggressive prostate cancer 


• PTEN loss c 


• More research 




• TMPRSS2-ERG fusions 3 


necessary 8 


Assess risk of malignancy of thyroid nodules with indeterminate results 


• HBME-1, Galectin-3 and CK19 d 


• More research 


on biopsy 




necessary 


Assess risk of malignancy of CT (± PET) imaging-identified indeterminate 


■ CEA, CYFRA 21-1, SCC, CA15.3, Pro-GRP, NSE 


• More research 


ung masses 




necessary 



a See [9] for details; b see [10] for details; c see [12,15] for details; d see [16,17] for details. HBME-1 is an antimesothelial cell monoclonal antibody; e PCA-3 was 
approved by FDA (2012) to help determine the need for repeat prostatic biopsy in men who had a previous negative biopsy. See also [18]. CA: carbohydrate 
antigen; CT: computed tomography; CK19: cytokeratin 19; CYFRA 21-1: cytokeratin fragment 21-1; FDA: Food and Drug Administration; HBME-1: human bone 



marrow endothelial 1; HE4: human epididymis protein 4; NSE: neuron-specific enolase; PET: positron emission tomography; pro-GRP: pro-gastrin releasing peptide; 
PCA3: prostate cancer gene 3; PSA: prostate-specific antigen; PTEN: phosphatase and tensin homolog; ROMA: risk of ovarian malignancy algorithm; TMPRSS2-ERG: 
transmembrane protease, serine 2 fTMPRSS2J and v-ets erythroblastosis virus E26 oncogene homolog (avian) (ERG) 



to improve on PSA's specificity in screening (such as 
prostate cancer gene 3) [12]; or, following diagnosis of 
prostate cancer, identification of biomarkers that discern 
between indolent versus aggressive prostate cancers. 
Table 2 summarizes these niche unmet needs. 

Another critical question is what could be done to 
avoid false discovery. Recommendations for this problem 
have been proposed elsewhere [2,8] and include under- 
standing and avoidance of pre-analytical shortcomings; 
careful study design to avoid bias [13,14]; use of analyti- 
cal methodologies that are sensitive, specific and precise; 
selecting appropriate samples (in numbers and quality) 
and patient subgroups for validation; and application of 
robust and rigorous statistics to avoid data over-fitting. 

Conclusion 

In theory, biomarkers can serve many clinical needs 
from risk stratification to prognosis, screening, diagno- 
sis, monitoring, patient subclassification, assessment of 
drug toxicity and prediction of therapeutic response. To 
bring biomarkers to the clinic, it is mandatory to show a 
useful clinical application that is supported by the vali- 
dation data. Only then will diagnostic companies invest 
the necessary (and very significant) funds to conduct 
multicenter clinical trials to show efficacy and receive 
FDA approval. In conclusion, between the two groups of 
biomarkers (false discovery and true discovery), the for- 
mer can be considered a failure but the latter should 
not. Distinguishing between these two categories is 
important, since true discovery is based on good science 
and statistically significant and reproducible data, while 
false discovery is based on bad science. As shown with 
examples, some fruits of true discovery can be combined 
to design incremental but clinically useful and FDA- 
approved improvements (Table 2). Eventually, the time 



will likely come when biology and technology will 
advance sufficiently to catalyze the needed and much 
anticipated leap in cancer biomarkers. 
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